Understanding Digital Documents Using Gestalt Properties of Isothetic Components

نویسندگان

  • Shyamosree Pal
  • Partha Bhowmick
  • Arindam Biswas
  • Bhargab B. Bhattacharya
چکیده

This paper introduces how Gestalt properties can be used for identifying various components in a document image. That the human mind makes a holistic approach to vision rather than a disintegrated approach is shown to be useful for document analysis. Since the major constituent components (textual or non-textual) in a document page are arranged in a rectilinear fashion, rectilinear/isothetic decomposition of different components are made on a document page. After representing the page as a feature set of its polygonal covers corresponding to the distinct regions of interest, each polygon is iteratively decomposed into the sub-polygons tightly enclosing the corresponding sub-components to capture the overall information as well as the necessary details to the desired level of precision. Subsequently, these components and sub-components are analyzed using Gestalt laws/properties, which have been explained in detail in the context of this work. Text regions, tabular structures, and various graphic objects readily admit some of the Gestalt properties. We have tested our algorithm on several benchmark datasets, and some relevant results have been produced here to demonstrate the effectiveness and elegance of the proposed method.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Supercover model, digital straight line recognition and curve reconstruction on the irregular isothetic grids

HAL is a multi-disciplinary open access archive for the deposit and dissemination of scientific research documents, whether they are published or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers. L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau...

متن کامل

Special Issue: Discrete Geometry for Computer Imagery Supercover model, digital straight line recognition and curve reconstruction on the irregular isothetic grids

On the classical discrete grid, the analysis of digital straight lines (DSL for short) has been intensively studied for nearly half a century. In this article, we are interested in a discrete geometry on irregular grids. More precisely, our goal is to define geometrical properties on irregular isothetic grids that are tilings of the Euclidean plane with different sized axis parallel rectangles....

متن کامل

Angle counts for isothetic polygons and polyhedra

In the case of isothetic simple polyhedra there are only six different types of 3D angles. This article states and proofs a formula about counts of these angles. This complements formulas in combinatorial topology such as Euler's polyhedron formula, or the previously known formula on angle counts for isothetic polygons. The latter formula and the shown equality for angle counts of isothetic sim...

متن کامل

Arc Recognition on Irregular Isothetic Grids and Its Application to Reconstruction of Noisy Digital Contours

In the present paper, we introduced an arc recognition technique suitable for irregular isothetic object. It is based on the digital inter-pixel (DIP) circle model, a pixel-based representation of the Kovalevsky’s circle. The adaptation to irregular image structurations allows us to apply DIP models for circle recognition in noisy digital contours. More precisely, the noise detector from Keraut...

متن کامل

Investigating the Effect of Pigment and Solvent Components on the Physical Properties of Digital Ink for the Decoration of Ceramic Tiles

The inkjet printing machine was initially used on ceramic parts as a small device in designing and advertising offices limitedly. In recent years, this device has been applied professionally to create high-quality patterns in the industry. In this method, liquid ceramic ink is sprayed on the specified parts by piezoelectric nozzles and after drying and discharging, the organic matter is baked a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IJDLS

دوره 1  شماره 

صفحات  -

تاریخ انتشار 2010